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ABSTRACT 

The correlation function of galaxy clusters has frequently been used as a test of 
cosmological models. A number of assumptions are implicit in the comparison of theo- 
retical expectations to data. Here we use an ensemble of ten large N-body simulations 
of the standard cold dark matter cosmology to investigate how cluster selection crite- 
ria and other uncertain factors influence the cluster correlation function. Our study is 
restricted to the idealized case where clusters are identified in the three dimensional 
mass distribution of the simulations. We consider the effects of varying the definition 
of a cluster, the mean number density (or equivalcntly the threshold richness or lumi- 
nosity) in a catalogue, and the assumed normalization of the cosmological model; we 
also examine the importance of redshift space distortions. We implement five different 
group-finding algorithms and construct cluster catalogues defined by mass, velocity 
dispersion or a measure of X-ray luminosity. We find that different cluster catalogues 
yield correlation functions which can differ from one another by substantially more 
than the statistical errors in any one determination. For example, at a fixed number 
density of clusters, the characteristic clustering length can vary by up to a factor of 
^ 1.5, depending on the precise procedure employed to identify and select clusters. 
For a given cluster selection criteria, the correlation length typically varies by ~ 20% 
in catalogues spanning the range of intercluster separations covered by the APM and 
Abell (richness class >1) catalogues. Distortions produced by peculiar velocities in red- 
shift space enhance the correlation function at large separations and lead to a larger 
clustering length in redshift space than in real space. The sensitivity of the cluster 
correlation function to various uncertain model assumptions substantially weakens 
previous conclusions based on the comparison of model predictions with real data. 
For example, some of our standard cold dark matter cluster catalogues agree better 
with published cluster clustering data (particularly on small and intermediate scales) 
than catalogues constructed from similar simulations by Bahcall & Cen and Croft & 
Efstathiou. Detailed modelling of cluster selection procedures including, for example, 
the effects of selecting from projected galaxy catalogues is required before the cluster 
correlation function can be regarded as a high precision constraint on cosmological 
models. 
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1 INTRODUCTION 

The clustering strength of rich galaxy clusters has long been 
used as a constraint on models of large scale structure. The 
two-point correlation function, ^cc, was first estimated for a 
sample of about 100 rich Abell clusters by Bahcall & Soneira 
(1983) and by Klypin & Kopylov (1983) who noted that 
clusters have a larger clustering amplitude than galaxies. 



This difference has a natural explanation in theories in which 
large scale structure grows by gravitational amplification of 
small fluctuations in an initially Gaussian density field. In 
such theories, collapsed objects form near peaks of the initial 
density field and a clustering pattern which depends on the 
height of the peak is imprinted at the epoch of formation 
(Kaiser 1984, Barnes et al. 1985). 
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Although the statistics of rare peaks provide an appeal- 
ing explanation for the different clustering strength of galax- 
ies and clusters, it soon became apparent that the early es- 
timates of ^cc were inconsistent with the predictions of the 
cold dark matter model, the paradigm of gravitational clus- 
tering theories (Davis et al. 1985). Using N-body simulations 
of this model. White et al. (1987) calculated a cluster clus- 
tering length, ro — ll/i~^Mpc (ro is defined as the separation 
at which ^cc = 1), whereas Bahcall & Soneira (1983) had 
obtained ro ~ 25/i~^Mpcfor Abell clusters of richness class 
R> 1. Bahcall & Soneira's estimate helped to motivate an 
alternative explanation for the formation of structure, based 
on fluctuations seeded by cosmic strings (Turok 1983, Turok 
& Brandenberger 1986). 

Sutherland (1988) pointed out that the apparent clus- 
tering amplitude of rich clusters would be artificially en- 
hanced if intrinsically poor clusters which happened to lie 
near the line-of-sight to a rich cluster - and thus appear 
rich in projection - were included in a rich cluster sample. 
A signature of this effect is an anisotropy in the correlation 
function which appears stronger along the line-of-sight than 
in the perpendicular direction. Soltan (1988) and Suther- 
land & Efstathiou (1991) showed that such anisotropies were 
clearly present in Bahcall & Soneira's sample and they, as 
well as Dekel et al. (1989), argued that correcting for this ef- 
fect would lower the clustering length of rich Abell clusters 
to ~ 14/i~^Mpc. Nevertheless, Postman, Huchra & Geller 
(1992), using a sample of 351 Abell clusters of richness class 
R > 0, supported Bahcall & Soneira's original estimate. 

Further progress had to await the construction of new 
cluster catalogues. These finally began to arrive in the early 
1990s. Dalton et al. (1992) and Lumsden et al. (1992) 
constructed the first automated cluster catalogues using 
the positions and magnitudes of galaxies in photographic 
plates scanned with the APM and Cosmos machines re- 
spectively; Lahav et al. (1993), Romer et al. (1994) and 
Nichol, Briel & Henry (1994) constructed cluster catalogues 
using a combination of X-ray and optical data. The ensu- 
ing redshift surveys led to new determinations of ^cc. The 
Oxford group estimated ro = 12.9 ± 1.4/i~^Mpc (Dalton et 
al. 1992) from a sample of about 200 APM clusters, and 
ro = 14.3 ± 1.2/i~'^Mpc from an extended sample of 364 
APM clusters (Dalton et al. 1994); Romer et al. obtained 
ro = 13.7± 2.3/i~^Mpc from an X-ray flux-limited sample of 
128 clusters. 

Unfortunately the new cluster samples have not fully re- 
solved the debate surrounding ^cc. Bahcall & Soneira (1983) 
argued that the measured values of ro depend very strongly 
on cluster richness. Bahcall & West (1992) interpreted the 
discrepancies between different samples as a reflection of 
their different mean cluster richness, rather than as a re- 
sult of contamination in Abell's catalogue (see also Peacock 
& West 1992). N-body simulations by Bahcall & Cen (1992) 
seem to support this view, whereas simulations by Croft & 
Efstathiou (1994) suggest that the dependence of ro on clus- 
ter richness is weak. Furthermore, the Oxford group have 
claimed that the low values of ro that they obtain for APM 
cluster catalogues, although much smaller than Bahcall & 
Soneira's (1983) value are still inconsistent with the stan- 
dard CDM cosmology and favour either CDM models with 
a low mean density and a non-zero cosmological constant or 
mixed dark matter models (Dalton et al. 1992, 1994, Croft 



& Efstathiou 1994). They base this conclusion on Croft & 
Efstathiou's (1994) set of large N-body simulations which, 
they argue, enable theoretical predictions for ^cc to be made 
with better than 10% accuracy over a wide range of scales. 

The work of Bahcall & Cen (1992), Croft & Efstathiou 
(1994) and Watanabe et al. (1994) has highlighted how, as 
the observational data on ^cc improves, the need for pre- 
cise theoretical predictions becomes increasingly important. 
Making theoretical predictions which are relevant to the in- 
terpretation of the data, however, is not straightforward, 
even for well specified models such as CDM and its vari- 
ants. In these models the evolution of the mass density field 
on the relevant scales can indeed be predicted quite accu- 
rately, particularly through large N-body simulations. The 
primary difficulty lies in the uncertain identification of clus- 
ters in the models with the real clusters found in galaxy 
surveys. Bahcall & Cen, Croft & Efstathiou and Watanabe 
et al. , like White et al. (1987), identified "galaxy clusters" 
in their simulations with large mass concentrations in the 
three-dimensional mass distribution. This is, of course, a 
very different procedure from that applied to real data where 
clusters are identified from the projected galaxy distribution 
using relatively complex algorithms. Possible biases in statis- 
tics such as ^cc which might be introduced by this procedure 
remain largely unexplored. 

In this paper, we address the restricted question of how 
the clustering strength of model clusters identified using the 
full three-dimensional mass distribution in N-body simula- 
tions depends on the details of the cluster finding algorithm 
and on the way in which the cluster catalogues are con- 
structed. For definitiveness, we consider only one specific 
cosmological model, the standard CDM model. We find that 
even in this idealized wide range of values of ro is 

obtained from samples selected and analyzed in ways which 
are in principle equally valid approximations to the real situ- 
ation. We therefore conclude that much more detailed mod- 
elling is required before the present data can be used as a 
high precision test of currently popular cosmological models. 

In the following section we describe our simulations and 
methods for constructing cluster catalogues. In Section 3 
we present estimates of the correlation function for these 
catalogues. In Section 4 we compare our results with those 
obtained in previous related studies. We discuss our findings 
and summarize our conclusions in Section 5. 



2 TECHNIQUES 

2.1 Numerical Simulations 

Our analysis is based on an ensemble of 10 CDM simula- 
tions performed with the adaptive AP^M code of Couchman 
(1991). Each simulation represents a comoving periodic box 
of length /box = 256 h~^Mpc Q containing 128'^ particles, 
each of mass ~ 2.2x W^^ Mq. The force softening (for an 
equivalent Plummer potential) was chosen to be ~ 65/i~^kpc 
and remained fixed in comoving coordinates. 

Initial conditions were laid down by perturbing par- 
ticles from a uniform grid using the Zel'dovich approxi- 

* Throughout this paper, we write Hubble's constant as Hq = 
lOO/ikms^^Mpc"^ 
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mation (Efstathiou et al. 1985), and assuming the BBKS 
CDM transfer function for the case of zero baryons and 
Ho = 50kms"^Mpc"^ (Bardeen et al 1986). We defined the 
expansion factor a such that as = a, where as is the hn- 
ear rms amplitude of mass fluctuations in spheres of radius 
8/i~^Mpc. Each simulation was evolved from a = 0.05 to 
a = 0.63 using a timestep Aa = 0.002 and each took ap- 
proximately two days of CPU on a Decstation 5000/240. 
The timestep and initial redshift were chosen after running 
a scries of trial simulations in which these parameters were 
varied. Although the cluster correlation function turned out 
to be insensitive to these variations, we found that adopting 
cither a later starting time or a larger timestep made a sig- 
nificant change to the abundance of clusters as a function of 
both mass and temperature, while adopting earlier starting 
times and smaller timesteps left them essentially unchanged. 

In the following analysis we identity the present day 
with the epoch at which ug — 0.5 or 0.63. These values were 
chosen to span the range of normalizations that are consis- 
tent with the mass and abundance of rich clusters of galaxies 
(White, Efstathiou & Frenk 1993) and are lower than would 
be required for consistency with the COBE microwave back- 
ground anisotropy measurements in the absence of a ten- 
sor mode contribution to the anisotropy (Smoot et al. 1992, 
Wright et al. 1994). Our simulations have a slightly smaller 
volume, but better mass and spatial resolution than those 
of Croft & Efstathiou (1994). 

2.2 Group-Finders 

In order to investigate the effect of varying the criteria by 
which clusters are defined and selected, we used five differ- 
ent group-finding algorithms to identify clusters of particles 
in the N-body simulations. In all cases we considered only 
groups with 8 or more particles, corresponding to a mass 
M > 3.5 X IQ^^H-'^Mq. 

The first was the standard friends-of-friends algorithm 
(Davis et al. 1985) that links together particles closer than 
some specified separation. We adopted a linking length of 
h = 0.2 times the mean interparticle separation for which 
groups have typical mean overdensities of ~ 200. We will 
refer to these groups as EOF. 

Our second group-finder was the "spherical overdensi- 
ties" algorithm described in more detail in Lacey & Cole 
(1994). This algorithm estimates tlic local density at the 
position of each particle by finding the distance to its tenth 
nearest neighbour. Particles axe ranked by local density and, 
starting with the particle in the densest environment, they 
arc used as centres about which spheres are infiated until the 
mean enclosed overdensity falls below a spccficd threshold. 
Overlapping groups are merged and centres recalculated un- 
til they lie within 0.2 h~ Mpc of the centre-of-mass of the 
grouped particles. We adopted an overdensity threshold of 
180. We shall refer to the resulting groups as SO. 

Our third algorithm is a variant of the spherical over- 
densities algorithm. Each particle is assigned an "X-ray 
luminosity", Lx,z = PiVi, where pi and Vi denote esti- 
mates of the local density and velocity dispersion obtained 
from the ten nearest neighbours. The motivation for this 
choice is that, when summed over a group of particles, the 
total "luminosity" will depend on density and tempera^ 
ture in the same way as bremsstrahlung emission, namely 



oc j p^T^ ''d?'r. We calculate the mean X-ray luminos- 
ity density by summing all of the individual particle values 
and dividing by the volume of the simulation box. Then 
the particles are ranked by X-ray luminosity and spheres 
infiated until the luminosity contrast in the spherical vol- 
ume, JL^/Lx = 10*. Although our simulations lack the spa- 
tial resolution to define the X-ray emitting regions well, this 
group-finder does give a higher weighting to the centres of 
the clusters and therefore has an effect which is qualitatively 
similar to that which we seek to represent. We will refer to 
groups identified in this way £is SOX. 

The final two group-finders we used are identical to 
those adopted by Croft & Efstathiou (1994) and allow us to 
make a direct comparison with their results. Tiie algorithm 
first locates potential group centres using the EOF algorithm 
with 6 = 0.1 to find tight knots of particles. Spheres of radius 
either 0.5 or 1.5 /i~^Mpc are constructed about these centres 
and, after merging overlapping spheres and recentring, the 
particles within this radius are considered as a group. We 
denote these as CE(0.5) and CE(1.5) respectively. 

In summary, we have chosen to investigate five differ- 
ent group-finding algorithms. The first, EOF, is the standard 
group-finder used extensively in previous analyses of N-body 
simulations. This is a simple and elegant algorithm which 
picks out most of the clusters identified as such by eye, al- 
though occasionally it classes two distinct lumps as a single 
group if they are linked by a tenuous bridge of particles. Our 
second algorithm, SO, avoids this situation by working with 
the average densities within spheres. The shortcoming of this 
method is that it unnaturally forces the boundaries of groups 
to be spheres. Our third algorithm, SOX, attempts to emu- 
late selecting clusters by X-ray emission. Given the nature 
and resolution of our simulations it is only approximate, but 
it does succeed in giving increased weight to groups contain- 
ing dense cores. The fourth and fifth group-finders, CE(0.5) 
and CE(1.5), we use for comparison with Croft & Efstathiou 
(1994) who adopted them as idealized 3-D versions of the 
galaxy counting algorithms that were used in constructing 
the APM and Abell galaxy cluster catalogues respectively. 

2.3 Cluster Selection 

The next step is to construct cluster catalogues from each set 
of groups identified by our different algorithms. In the real 
world, the abundance of clusters in a sample is determined 

by setting a threshold in apparent optical richness or X- 
ray flux. In our models, for each choice of group-finder, we 
ranked the groups identified in all Na\m = 10 simulations 
according to mass (M), velocity dispersion (v), or X-ray 
luminosity (Lx). We then selected the A^cius highest ranked 
clusters to produce a cluster catalogue in each simulation 
for which the mean intercluster separation is given by 

rfc = (iVsim/iVclus)'/'«box. (1) 

We investigate how the resulting cluster correlation function 
depends on abundance, as parameterized by do, as well as 
on the other details of the cluster selection process. 

Cluster X-ray luminosities were defined in one of two 
ways. For the SOX groups, we simply summed the lumi- 
nosities assigned to the individual particles in the group. For 
clusters obtained with other group-finders we first estimated 
a velocity dispersion (from the measured mass assuming an 
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Figure 1. A 50/i~^Mpc thick slice through one of the ten sim- 
ulations with erg = 0.63. The clusters were selected using the 
frieiids-of-friends (FOF) group-finder and adopting a lower mass 
cutoff such that the mean cluster separation = 30/i~^Mpc. 
Particles belonging to clusters are shown as dots and each cluster 
is delineated by a circle around its centre with radius equal to 1.5 
times the true distance to the most distant cluster particle. 
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Figure 2. As Fig. 1, but with X-ray luminosity rather than mass 
as the selection statistic for the FOF groups. 



isothermal density distribution), converted this into an X- 
ray temperature, Tx, by assuming that the specific kinetic 
energy in the dark matter is equal to the specific thermal 
energy in the gas (Evrard 1990; Navarro, Prenk & White 
1995), and inferred an X-ray luminosity from the empirical 
Z/x — Tx relation. For the last step we adopted the mean 
relation given by David et al. (1993), Lx oc T^'^. (Since we 
are only interested in ranking the clusters by X-ray luminos- 
ity, the constant of proportionality is immaterial.) We then 
assumed a Gaussian scatter with variance varying linearly 
with logioTx. This gave criogioix = 0.58 at logioTx = 0.3, 
and ciogio^x = 0-31 at logioTx = 0.9. This procedure es- 
sentially scrambles up a ?;-sclcctcd catalogue by introducing 
some lower velocity dispersion clusters at the expense of 
higher velocity dispersion ones. 

For each of our group-finders and selection statistic, cat- 
alogues with 20 < dc < 70 /j~^Mpc were created for both 
(Tg = 0.5 and ag = 0.63. Figs. 1 and 2 show a slice through 
one of the simulations. In both cases, clusters were found 
using FOF, but the clusters in Fig. 1 were selected by mass 
whereas those in Fig. 2 were selected by "X-ray" luminosity. 
The "X-ray" sample can be seen to have some very small 
clusters that have come in at the expense of more extended 
objects in the mass selected sample. 

In summary, we have used five different group-finders: 
FOF, SO, SOX and the two employed by Croft & Efstathiou 
(1994), CE(0.5) and CE(1.5). From the FOF and SO groups, 
cluster catalogues selected according to mass, velocity dis- 
persion and X-ray luminosity were produced. For the SOX 
group-finder, clusters were selected either by X-ray lumi- 
nosity or by velocity dispersion. The cluster catalogues pro- 
duced from the CE(0.5) and CE(1.5) groups were selected 
according to mass only, in the same way as the clusters an- 
alyzed by Croft & Efstathiou (1994). In what follows, we 
indicate with a subscript the statistic used for constructing 
a catalogue from the list of groups obtained with a partic- 
ular group-finder. Thus, for example, FOFm denotes clus- 
ters identified with the friends-of-friends group-finder and 
selected according to mass. 



2.4 Correlation function estimator 

Cluster correlation functions in real space, (,cc{r), and in 
redshift space, Ccc(s), were obtained for each of our cluster 
catalogues. In the latter case, the simulations were projected 
along one of the principal axes and the component of each 
cluster's peculiar velocity along that axis was added to the 
Hubble velocity. We used the estimator 



^cc(a;) = 



nlVdV 



-1, 



(2) 



where x denotes either r or s, iVp is the number of cluster 
pairs with separation in a bin of volume dV centred at x, 
ric is the number density of clusters in the catalogue, and V 
is the total volume. Estimates from all 10 simulations were 
averaged and the scatter amongst them used to obtain the 
error in ^cc 

Our correlation functions are not well fit by a single 
power-law over the entire range of pair separations sampled 
in our models. Thus, to estimate the correlation length, xo, 
we made a two-parameter fit of the form, 
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Figure 3. Real space correlation functions for SO groups in a CDM model with as = 0.63. The different symbols correspond to catalogues 
selected according to mass (open circles), velocity dispersion (filled circles), and "X-ray" luminosity (open triangles). The lines in each 
panel link the triangles and appear also on Fig. 4 for ease of comparison. 



(3) 

over a limited range in x, near the value where ^cc = 1. By 
fitting over a narrow range in x, our inferred values of xo 
do not depend strongly on the value of the slope, 7, but our 
estimates of 7 are only applicable in this limited range of 
pair separations. The actual range in x used depends on the 
amplitude of ^cc, but is well approximated by the limits 

^+0.5<log,,(.)<^ + 1.0, (4) 

where dc and x are measured in /i~^Mpc. This range corre- 
sponds to four or five radial bins around x = xo- The vari- 
ances of the fit parameters wore recovered from the diagonal 
elements of the covariance matrix. 



3 RESULTS 

Tables 1 and 2 give correlation lengths and power-law slopes 
for real and redshift space correlation functions, for a selec- 
tion of our cluster catalogues. (Error bars were obtained as 
described in section 2.4). Full correlation functions arc plot- 
ted in Figs. 3 and 4 for samples constructed using a subset 
of our selection statistics and spanning a wider range of val- 
ues of dc than those shown in the tables. We now discuss 
several trends apparent in the data. We focus exclusively on 
estimates of the correlation length, and ignore variations in 
the slope, 7, since it is most often the former that is used 
to compare models with the data. (For reference, we list our 
estimates of 7 in the tables, recalling that they refer exclu- 
sively to the region where ^cc — !•) 
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Figure 4. Redshift space correlation functions for SO groups in a CDM model with crs = 0.63. The different symbols correspond to 
catalogues according to mass (open circles), velocity dispersion (filled circles), and "X-ray" luminosity (open triangles). The solid lines 
are the same as those in Fig. 3 and are repeated here so that the increase in caused by using redshift distances rather than true 
distances may be seen clearly. 



3.1 Dependence of the real space correlation 

function on cluster selection and abundance 

We first consider the effect on ^cc of varying the procedure 
for identifying and selecting clusters in real space. For each 
choice of selection statistic (M, v and Lx), FOF and SO clus- 
ters give consistent results in almost all cases. We conclude 
that for the groups we are considering - the most massive 
in the simulations - the FOF and SO finders essentially pick 
out the same objects. SOX clusters also tend to have simi- 
lar values of ro as FOF and SO clusters, whether they are 
selected by v or Lx. 

Our real space results for CE clusters are in excel- 
lent agreement with those obtained by Croft & Efstathiou 
(1994). Note that whilst the CE(1.5)^ clusters give results 
consistent with those of FOFm and SOm, the CE(0.5)jj^ 



clusters give the smallest values of ro of any mass selected 
clusters. The largest difference between CE(0.5)m and SOm 
clusters is 3.3 /i~^Mpcand occurs for dc = 50/i~^Mpcand 
0-8 = 0.5. 

For any given cluster finding algorithm there are of- 
ten trends either with the selection statistic or with crs- For 
example, when dc is largo, clusters selected by mass have 
larger values of ro than clusters selected by v, with a largest 
difference of 2.8 /i~^Mpc between SOm and FOF„ clusters 
for (78 = 0.5. Clusters selected by X-ray luminosity tend to 
be slightly more weakly clustered than clusters selected by 
mass or velocity dispersion. This is because of the scatter 
in the Lx — Lx relation and the trend of clustering strength 
with velocity dispersion. Thus, for fixed dc and crs, the SOi^ 
and SOX clusters tend to give smaller values of ro than the 
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Table 1. The real space correlation length, ro, and the slope of the correlation function, 7, for a variety of group-finders and cluster 
selection criteria for two different values of mean cluster separation, dc, and normalization, as- Errors are Icr and were estimated as 
described in section 2.4. 



Group-Finder Selection dc = 30 /i~iMpc dc=50h~^Mpc dc = 30/i~iMpc <ic = 50ft~iMpc 
<78 = 0.5 (78 = 0.5 0-8 = 0.63 as = 0.63 



FOF 


Mass 


ro = 


8.50 it 0.15 




10.15 ±0.79 


8.79 ±0.18 




12.01 ±0.58 






7 = 


2.15 ±0. 


,11 


2.13 ±0.57 


2.18 ±0. 


13 


2.63 ± 0.37 


SO 


Mass 


ro = 


8.56 ±0.15 




11.24 ± 0.70 


8.73 ±0.17 




10.80 ± 0.84 






7 = 


2.18 ±0. 


,11 


2.25 ± 0.49 


2.22 ±0. 


11 


2.21 ±0.61 


CE(0.5) 


Mass 


ro = 


8.31 ±0.17 




7.97 ±0.71 


8.29 ±0.17 




9.59 ± 0.52 






7 = 


2.02 ±0, 


12 


1.89 ±0.48 


2.14 ±0. 


12 


3.09 ±0.64 


CE(1.5) 


Mass 


ro = 


8.80 ±0.13 




10.70 ±0.64 


8.84 ±0.16 




11.22 ±0.64 






7 = 


2.27 ±0, 


,09 


2.53 ±0.51 


2.42 ±0. 


12 


2.96 ±0.55 


FOF 


V 


ro = 


8.64 ±0.16 




8.45 ±0.71 


8.75 ±0.20 




10.12 ±0.69 






7 = 


2.21 ±0. 


,12 


2.48 ± 0.73 


2.19 ±0. 


,15 


2.74 ±0.66 


SO 


V 


ro = 


8.54 ±0.14 




9.28 ±0.50 


8.59 ±0.17 




9.89 ± 0.68 






7 = 


2.19 ±0. 


,11 


2.85 ± 0.66 


2.04 ±0. 


11 


2.67 ± 0.65 


SOX 


V 


ro = 


8.26 ±0.18 




9.00 ± 0.76 


8.32 ±0.19 




10.29 ±0.66 






7 = 


1.95 ±0, 


,12 


1.91 ±0.46 


2.17 ±0. 


15 


2.81 ±0.66 


FOF 




ro = 


7.88 ±0.19 




9.14 ±0.73 


7.87 ±0.16 




8.83 ± 0.80 






7 = 


2.33 ±0, 


17 


2.60 ±0.98 


2.11 ±0. 


,14 


1.99 ±0.50 


SO 




ro = 


7.34 ±0.19 




6.41 ± 1.12 


7.72 ±0.17 




9.46 ± 0.62 






7 = 


2.17± 0, 


15 


2.26 ±0.85 


2.27 ±0. 


15 


2.72 ± 0.77 


SOX 


Lx 


ro = 


7.76 ±0.16 




5.28 ± 1.80 


8.05 ±0.18 




10.02 ± 0.70 






7 = 


2.26 ±0. 


,13 


1.53 ± 0.85 


2.28 ±0. 


13 


2.45 ± 0.60 



Table 2. The redshift space correlation length, so, and the slope of the correlation function, 7, for a variety of group- finders and cluster 
selection criteria for two different values of mean cluster separation, dc, and normalization, as- Errors are la and were estimated as 
described in section 2.4. 



Group-Finder 


Selection 


dc 




30/!.-iMpc 




dc = 50/i-iMpc 


dc = 30/!.-iMpc 


dc 


= 50/t-iMpc 






as 




0.5 




as = 0.5 


as = 0.63 


as 


= 0.63 


FOF 


Mass 


SQ 




9.31 ± 0.16 




10.76 ± 0.76 


9.93 ±0.16 


12, 


,33 ± 0.65 






7 




2.46 ±0, 


12 


2.02 ±0.47 


2.29 ±0.09 




2.61 ±0.35 


SO 


Mass 


so 




9.30 ± 0.15 




11.82 ±0.62 


9.81 ±0.16 


12, 


,22 ±0.76 






7 




2.55 ±0, 


,10 


2.59 ±0.47 


2.36 ±0.08 




2.41 ± 0.43 


CE(0.5) 


Mass 


so 




9.25 ±0.16 




10.45 ± 0.53 


9.89 ±0.16 


11, 


,26 ±0.56 






7 




2.30 ± 0. 


,10 


2.58 ± 0.44 


2.04 ± 0.09 




2.64 ± 0.43 


CE(1.5) 


Mass 


so 




9.38 ±0.16 




10.89 ± 0.56 


9.89 ±0.15 


11, 


,90 ± 0.64 






7 




2.52 ±0. 


,09 


2.70 ± 0.47 


2.41 ±0.10 




2.71 ± 0.40 


FOF 


V 


so 




9.31 ±0.14 




9.28 ±0.71 


10.04 ±0.17 


11, 


,07 ± 0.66 






7 




2.51 ±0, 


,11 


2.79 ±0.67 


2.45 ±0.10 




3.12 ±0.62 


SO 


V 


so 




9.59 ±0.16 




10.18 ± 0.53 


10.29 ± 0.15 


12, 


,03 ±0.58 






7 




2.53 ±0, 


12 


3.14 ±0.54 


2.36 ±0.07 




3.05 ±0.34 


SOX 


V 


so 




9.42 ± 0.16 




11.61 ±0.60 


10.32 ± 0.15 


12, 


,02 ±0.56 






7 




2.52 ±0, 


,13 


2.74 ±0.34 


2.41 ± 0.08 




2.99 ± 0.40 


FOF 


Lx 


so 




8.67 ±0.16 




9.39 ± 0.94 


9.19 ±0.15 


11, 


,10 ±0.56 






7 




2.60 ± 0. 


,12 


2.00 ± 0.70 


2.46 ±0.10 




2.82 ± 0.59 


SO 


Lx 


so 




8.53 ± 0.14 




7.94 ± 0.49 


9.44 ±0.15 


11, 


,65 ± 0.62 






7 




2.32 ± 0. 


,12 


2.69 ± 0.47 


2.45 ±0.10 




2.74 ± 0.33 


SOX 


Lx 


so 




8.72 ± 0.14 




9.21 ±0.59 


9.18 ±0.13 


10, 


,98 ± 0.69 






7 




2.53 ±0. 


,13 


2.37 ±0.55 


2.33 ± 0.09 




2.64 ±0.55 



other catalogues. At dc = , where the statisti- 

cal errors are small, the differences between the correlation 
lengths measured from the various cluster catalogues can ex- 
ceed 5a and at dc ~ 50/i^^Mpcthe minimum and maximum 
are separated by at least 4(7. Our data also show a weak 
but significant trend for ro to increase with increasing as- 



Typically, with the same cluster definition at erg = 0.5 and 
as = 0.63, the differences in ro are l/i~^Mpcor less. 

A clear trend in our real space data is the tendency 
for the correlation length to increase with increasing mean 
cluster separation. This trend is stronger for the larger value 
of as- Some illustrative cases are plotted in Fig. 5 (for v- 
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Figure 5. Variation of the real space correlation length, ro, with 
mean intcrclustcr separation, dc for a selection of cluster cata- 
logues. Real space correlation functions were calculated for cata- 
logues selected according to 1-D velocity dispersion, v, for SO (cir- 
cles) and SOX (triangles) clusters. Open symbols are for erg = 0.5 
and filled symbols for 0.63. 

selected clusters). The data are reasonably well fit by a linear 
relation, although for dc > 40/i~^Mpcthc uncertainties arc 
too large to rule out a flatter trend as advocated by Croft 

6 Efstathiou (1994). A more detailed comparison with this 
and other work is made in the next section. For our SO^ 
clusters, the ro — dc relation can be approximately fit by a 
linear relation of the form: 

ro = (0.056 ± 0.014)dc + (6.36 ± 0.69) /i"^Mpc (5) 
and 

ro = (0.090 ± 0.009)dc + (5.82 ± 0.44) /i"^Mpc (6) 

for as = 0.5 and 0.63 respectively. 

In summary, the real space correlation length of rich 
clusters identified in three dimensions is only weakly depen- 
dent on the normalization of the power spectrum, but it 
can vary considerably depending on the procedure used to 
define a cluster catalogue and on the abundance of the ob- 
jects under consideration. This variation can be much larger 
than the statistical uncertainties in the individual determi- 
nations. For example, for as = 0.5 and dc = 30 h~^Mpc , the 
largest variation seen in Table 1 is 1.46 /i~^Mpc, compared 
with atypical uncertainty of ~ 0.16/i~^Mpcin the individual 
determinations. 

3.2 Redshift space efFects 

When redshift distances rather than true distances are used, 
the correlation function is distorted in various ways. Struc- 




5 10 15 20 

r/(h-iMpc) 



Figure 6. Contours of constant ^cc for a CDM model with erg = 
0.63. Results are shown for SO^ clusters with dc = 20/t^^Mpc. 
The levels are at 0,0.2,0.5,1,2,3 and 4. Real space correlations 
are plotted as full lines and redshift space correlations as dashed 
lines, with the ^cc = 1 contour in bold. 



tures on small scales are smeared out by peculiar velocities, 
while structures on large scales are amplified by coherent 
infall (Kaiser 1987). As a result, the correlation function in 
redshift space is flatter on small scales, steeper on interme- 
diate scales and has a larger amplitude on largo scales than 
the real space correlation function. These effects are read- 
ily apparent in Fig. 6 which shows contour plots of ^ 

cc as 

a function of projected separation on the sky, r-p, and dis- 
tance along the linc-of-sight, r^. For small values of Vp, the 
contours of constant gcc are elongated along the direc- 
tion because of smearing and for large values of rp they are 
elongated along the rp direction as a result of coherent infall. 

A selection of our redshift space correlation functions 
arc plotted and compared to their real space counterparts 
in Fig. 4. In all cases, the net effect of redshift space dis- 
tortions is to increase the amplitude of the correlation func- 
tion on scales 3<s/( /i~^Mpc)<30. As a result, the values 
of So (the redshift space separation at which 5cc(s) — 1) 
are significantly larger than the corresponding values of ro. 
As expected, the differences are greater for larger values 
of as since the induced peculiar velocities grow with the 
amplitude of the mass fluctuations. These effects arc fur- 
ther demonstrated in Tables 1 and 2. The enhancement of 
the correlation length in redshift space depends somewhat 
on the group-finder and selection statistic used. On aver- 
age. So is larger than ro by l.lft~^Mpcfor as = 0.5 and by 
1.4/t^"'^Mpc for as = 0.63. Note, however, that in individual 
cases the redshift space enhancement can be considerably 
larger than this. 

The increase in the correlation length with increasing 



© 0000 RAS, MNRAS 000, 000-000 



Cluster Clustering 9 



15 - 



10 - 



5 - 

- A Real sj 

- i Real sf 

- o Redshi: 

- • Redshi; 

I ' ' ' — 

20 40 60 80 

dy(h-iMpc) 

Figure 7. Variation of the rcdshift space correlation length, so, 
with mean intercluster separation, dc, for a selection of cluster 
catalogues. Data arc for SO groups selected according to velocity 
dispersion. Open symbols show results for as = 0.5 and filled 
symbols for trg = 0.63. Circles represent our redshift space results 
and triangles the corresponding real space results from Fig. 5 
which are included for compajrison. 

intercluster separation is slightly more pronounced in red- 
shift space than in real space. An illustrative case, SO„ clus- 
ters, is shown and compared with the corresponding real 
space data in Fig. 7. For as = 0.63, so grows approximately 
linearly with dc out to the largest values of dc considered, 
dc = 70/i^^Mpc, at which so ~ 15/ir^Mpc. The relation 
between so and dc is approximately given by 

so = (0.063 ± 0.020)dc -|- (7.31 ± 0.72) h'^Mpc (7) 

and 

So = (0.109 ± 0.007)dc + (6.87 ± 0.35) h'^Mpc (8) 
for as = 0.5 and 0.63 respectively. 



4 COMPARISON WITH PREVIOUS 
SIMULATIONS 

Cluster correlation functions calculated from N-body simu- 
lations of the CDM model have been published previously 
by White et al. (1987), Bahcall & Cen (1992), and Croft & 
Efstathiou (1994). The first of these studies sampled a rel- 
atively small volume so the resulting correlation functions 
have large uncertainties; they are consistent with our results. 

Bahcall & Ccri (1992) simulated a single cube of side 
400/i~^Mpc , in the standard CDM model, and calculated 
cluster correlation functions in real space only (Cen, private 
communication). They identified groups of particles using 



20 40 60 80 

dy(h-iMpc) 

Figure 8. Comparison of simulation results from different stud- 
ies. The plot shows the real space correlation length, ro, as a func- 
tion of the mean intercluster separation, dc. The dashed line gives 
the results of Bahcall & Cen (1992) which should be compared 
with our results for FOFjvf clusters (circles). The solid line gives 
results from Croft & Efstathiou (1994) which should be compared 
with our results for CE(0.5) (triangles). Within the statistical er- 
rors, our results agree with the two other studies even though 
these are inconsistent with one another. 

a variant of our POP algorithm and selected clusters ac- 
cording to mass. Thus, for a fair comparison, we consider 
our catalogue of FOPm clusters. The comparison is done in 
Fig. 8 where we plot correlation lengths in real space as a 
function of mean intercluster separation. For small values 
of dc (corresponding to relatively poor clusters), Bahcall & 
Cen's results lie only slightly below ours and for interme- 
diate values the agreement is very good. For the rarest ob- 
jects (dc>50/i~^Mpc), Bahcall & Cen apparently find that 
the ro-dc relation continues to increase linearly whereas our 
relation flattens off. Unfortunately, Bahcall & Cen do not 
quote uncertainties in their estimates but, since the volume 
they simulated is only about 40% of the volume we have sim- 
ulated, their error bars will typically bo ~ 1.6 larger than 
ours and so the disagreement at large values of dc is only 
marginally significant. 

In Fig. 8 we also compare our real space results with 
those of Croft & Efstathiou (1994). Like Bahcall & Cen, 
they examined only one cluster selection algorithm (CE in 
Section 2) which we deliberately included in our list for com- 
parison purposes. Again, the agreement is very good, partic- 
ularly for dc ~ 40/i~^Mpc, where the error bars are small. 
For larger values of dc the results are still statistically con- 
sistent although Croft & Efstathiou's values lie somewhat 
below ours (Croft & Efstathiou do give error bars but, for 
clarity, we have omitted these in the figure; they are slightly 
smaller than ours.) As pointed out by Croft & Efstathiou, 



m 
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Figure 9. Comparison with the results of Croft & Efstathiou 
(1994). Correlation functions from the present study are shown 
by squares and those from Croft & Efstathiou by triangles. Open 
symbols show data in real space and filled symbols in redshift 
space. All correlation functions are for CE(0.5) groups with = 
30/i-lMpc, but our simulations have as = 0.63 whereas those of 
Croft & Efstathiou have as = 1-0. 



their results disagree with those of Bahcall & Cen, except 
for the smallest values of dc- Croft & Efstathiou suggested 
that this discrepancy could be due to a statistical fluctuation 
in Bahcall & Cen's single simulation. The two simulations 
also differ in their value of as (0.77 in the case of Bahcall 
& Cen; 0.59 in the case of Croft & Efstathiou; and an in- 
termediate value in our case). However, as Fig. 7 shows, the 
effect of as on the real space correlation length is too weak 
to account for the difference between the results of Bahcall 
& Cen and those of Croft & Efstathiou. Fig. 8 suggests that 
the discrepancy might be caused by the use of different clus- 
ter selection criteria. When we use similar scloctiou criteria, 
we find reasonably good agreement with both studies. 

The previous comparison referred exclusively to corre- 
lation lengths in real space. Bahcall & Cen do not give any 
results in redshift space, but Croft & Efstathiou do and a 
comparison of their results and ours (for the same cluster 
selection criteria) is made in Fig. 9. Here we plot the full cor- 
relation functions for CE(0.5) clusters with dc = 30ft~^Mpc , 
both in real space (open symbols) and in redshift space 
(filled symbols). The agreement in real space is excellent, 
confirming our earlier conclusion from Fig. 8. In redshift 
space, on the other hand, there are some discrepancies, par- 
ticularly at small and intermediate pair separations, where 
our correlation function lies systematically above Croft & 
Efstathiou's. The differences are small but significant given 
the small quoted errors. For example, at s = 6.3h~^Mpc , our 
value of ^cc is about 60% higher than Croft & Efstathiou's. 
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Figure 10. Comparison between our CDM cluster correla- 
tion function and data for X-ray selected clusters. The squares 
show our redshift space results for SOXj^^ clusters with dc = 
40h~^Mpc and as = 0.63. The triangles show the correlation 
function estimated by Romer et al. (1994) for a sample of clus- 
ters selected using a combination of ROSAT X-ray data and opti- 
cal data. The upper panel shows logj^Q |$c(s)|, with open symbols 
corresponding to values of icc{s) < 0. The lower panel is a linear- 
log plot of the region where ^cc < 1 which shows the large scale 
behaviour more clearly. 



These differences are likely to be an underestimate since our 
simulation has as = 0.63, whereas theirs has as = 1 and, 
as we have seen, in redshift space Ccc(s) is larger for larger 
values of as . Apart from this difference in as , the only other 
difference between the two analyses is the sampling strat- 
egy in the computation of ^cc(s)- Our estimate is based on 
a straightforward computation using all the clusters in the 
simulations satisfying the selection criteria, whereas Croft 
& Efstathiou averaged over several realizations of subcata- 
logues with the same abundance and selection function as 
the APM cluster catalogue of Dalton et al. (1992). Provided 
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Figure 11. Comparison between CDM cluster correlation func- 
tions and data for optically selected clusters. The triangles show 
correlation functions for APM clusters; filled triangles for the 
dc = 31/i~^Mpc sample of Dalton et al. (1994), and open triangles 
for the dc = 45ft~^Mpc sample of Efstathiou et al. (1992). The 
open circles show the correlation function estimated by Nichol 
et al. (1992) for the dc — 50 ft~ ^ Mpc clusters in the Durham- 
Edinburgh survey. The dashed line shows our model results for 
SO^ clusters, with dc = 30/i^^Mpc, and the solid line for SOt, 
clusters with dc = 50/i~^Mpc. All model correlation functions 
were calculated in redshift space. 

the correct selection function is used this procedure will give 
an unbiased estimate of the true correlation function in the 

simulation as a whole - the quantity which wc have calcu- 
lated directly. Possible explanations for the relatively small 
discrepancies in Fig. 9 are residual systematic differences re- 
sulting from different simulation techniques and the choice 
of erg. 

In summary, apart from the small differences in redshift 
space just mentioned, the cluster correlation functions in our 
study agree well with previous published work, provided the 
comparison is made for cluster catalogues identified and se- 



lected tn similar ways. The apparent disagreement between 
the work of Bahcall & Cen (1992) and that of Croft & Ef- 
stathiou (1994) appears to have been largely caused by a 
different choice of group-finding algorithm. 



5 DISCUSSION AND CONCLUSIONS 

BahcaU & Cen (1992) and Croft & Efstathiou (1994) com- 
pared their simulation results to real data and in both cases 
concluded that the correlation function of clusters is incom- 
patible with the standard CDM model, but is consistent 
with a low density CDM model with Q,h ~ 0.2. At first sight 
this consensus seems rather surprising since, as Fig. 8 shows, 
the predicted cluster correlation functions in these two stud- 
ies disagree. The explanation is simply that the comparison 
in each case was made against different datasets. Croft & 
Efstathiou compared their models to the APM cluster cat- 
alogue of Dalton et al. (1992) whereas Bahcall & Cen com- 
pared theirs to this catalogue and to Abell's catalogue as 
well. As may be seen from Fig. 3 of Bahcall & Cen (1992), 
their cluster correlation function for the low density model 
does not agree particularly well with the APM data and, as 
can be seen from Fig. 4 of Croft & Efstathiou (1994), their 
low density model strongly disagrees with the Abell cluster 
data. Thus, the two studies were able to arrive at the same 
conclusion because they compared different theoretical pre- 
dictions for the same cosmological models against datasets 
that exhibit different clustering properties. 

We have found here that even in the idealized case 
where clusters are identified in the three-dimensional mass 
distribution of a simulation, significantly different outcomes 
for the cluster correlation function are possible depending 
on how exactly the clusters are defined and on how the data 
are analyzed. It is unclear which, if any, of the various pos- 
sible definitions of clusters in the simulations is appropriate 
for a comparison with the real data. This difficulty is par- 
ticularly severe in the case of optical catalogues since the 
identification of clusters in the projected galaxy distribu- 
tion is very different from the identification of clusters in 
the three-dimensional mass distribution of a simulation. Bi- 
ases in ^cc arising from projection effects have been shown 
to be present in Abell's catalogue (Sutherland 1988; Efs- 
tathiou & Sutherland 1991; Dekel et al. 1989). The lack of 
large anisotropics in ^cc for APM clusters suggests that this 
catalogue is largely unaffected by biases of this kind, but this 
important feature by itself does not remove the ambiguity 
regarding the identification of galaxy clusters in real cata- 
logues with mass clusters in simulations. Identifying cluster 
populations in both by matching the spatial number density 
is not a unique procedure since, as we have shown, even at 
a fixed number density, the cluster correlation function de- 
pends, for example, on the statistic used to rank the clusters. 
In practice, it seems likely that even larger uncertainties will 
be introduced by the difficulty of determining the richness 
of clusters in projection and by the associated uncertain- 
ties in the estimation of their spatial number density. X-ray 
selected clusters provide, in principle, cleaner observational 
samples, but even in this case the comparison with theoret- 
ical models is restricted by the lack of reliable predictions. 

Figs. 10 and 11 illustrate how some of the uncertainties 
we have mentioned can affect the confidence with which a 
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specific model is constrained by measurements of ^cc- Here 
we compare predictions based on tlie standard cold dark 
matter model - the model which Bahcall & Cen (1992), 
Croft & Efstathiou (1994) and Dalton et al. (1994) claim 
to be strongly excluded by the cluster correlation function 
data - with various observational determinations of ^cc- In 
Fig. 10 we compare our estimates for SOX "X-ray selected" 
clusters with the ROSAT data of Romer et al. (1994). This 
catalogue is not volume-limited and thus contains clusters 
with a range of intrinsic X-ray luminosities. The simulations, 
however, indicate that the variation of the correlation length 
of SOX "X-ray selected" clusters with cluster X-ray lumi- 
nosity (or richness) is small compared with the uncertainties 
in the data. Within these large statistical errors, the agree- 
ment is good except, perhaps, on the largest scales where the 
observed signal is small and could be affected by systematic 
uncertainties in the mean number density of clusters. The 
size of the discrepancy on large scales may be better appre- 
ciated in the linear-log plot in the lower panel of this figure. 

In Fig. 11, we compare our estimates for SO„ clusters 
(in redshift space) with data for "optical" clusters from the 
APM (Efstathiou et al. 1992, Dalton et al. 1992, 1994) and 
EDCC (Nichol et al. 1992) catalogues. The APM sample 
shown by filled triangles has a mean intercluster separation, 
dc ^ 31/i~^Mpc, and should be compared with the dashed 
line which shows our model predictions for the same mean 
intercluster separation. The APM sample shown by open 
triangles has dc — 45/t^^Mpcand the EDCC sample has 
dc ^ 50/i~^Mpc. These should be compared with the solid 
line which shows our model results for dc = 50/i~'^Mpc. 
On intermediate scales only the denser sample (which has 
the smallest error bars) is inconsistent with the model, but 
the discrepancy is quite small and certainly much smaller 
than the discrepancy found by Dalton et al. (1994) for the 
same model (cf their Fig. 4). The reason for this difference 
is simply that the group-finder applied to the simulations by 
Dalton et al. happens to give one of the lowest correlation 
functions of all the group-finders that we have explored in 
this paper. On large scales there is an indication that the 
data are more clustered than the models and, again, the 
linear-log plot clearly shows that this discrepancy is small. 

To summarize, large N-body simulations allow very pre- 
cise estimates of the cluster correlation function once a spe- 
cific prescription for identifying clusters is adopted. For a 
given cosmological model, the statistical uncertainties in 
these predictions are small compared to the observational 
errors for existing cluster samples. Unfortunately, they are 
also small compared to the systematic variations exhibited 
by cluster samples constructed from the simulations by mak- 
ing different assumptions. Our analysis shows that the exact 
form and amplitude of the correlation function of clusters 
identified in the mass distribution of N-body simulations 
depends on various factors. In rough order of importance 
these include: (i) the group-finding algorithm and the statis- 
tic used to rank clusters in a catalogue (eg. mass, velocity 
dispersion, "X-ray" luminosity, etc); (ii) the mean density 
of clusters in the catalogue; (iii) whether clustering is mea- 
sured in real or in redshift space; and (iv) the assumed value 
of Us. These various factors can produce large variations in 
the resulting clustering length. For example, in the range 
most relevant to observational data, 30 < dc < 60/i~^Mpc, 
our "X-ray selected" catalogues in a model with as = 0.5 



have real space clustering lengths varying between 7 and 
10/i~^Mpc, whereas catalogues selected by velocity disper- 
sion in a model with erg = 0.63 have redshift space correla- 
tion lengths varying from 10 to 13 /i~^Mpc. 

Of the four complicating factors listed above, only the 
first two refer directly to cluster catalogues. The third one 
should be straightforward to eliminate but it has sometimes 
been ignored in comparisons of model predictions with data 
(eg. Bahcall & Cen 1992). Similarly, the value of as appro- 
priate to a given cosmological model is usually fixed from 
other considerations such as the amplitude of fluctuations 
in the temperature of the cosmic microwave background or 
the abundance of galaxy clusters (White, Efstathiou & Frcnk 
1993). There arc, however, uncertainties associated with this 
procedure arising, for example, from possible contamination 
of the microwave background signal by tensor modes or un- 
certainties in the masses of clusters. 

The resolution of the cluster clustering debate will re- 
quire further observational and theoretical work. From the 
observational point of view, progress will come from the 
analysis of large homogeneous samples of clusters selected 
entirely from X-ray data or from large redshift surveys such 
as the forthcoming SDSS and 2df galaxy surveys. From the 
theoretical point of view, it will be necessary to model in 
detail the selection procedures employed by observers. Arti- 
ficial catalogues constructed from cosmological simulations 
are a valuable aid, but several complications need to be 
borne in mind. For example, to model the selection of APM 
clusters, it is necessary to simulate the entire APM galaxy 
survey and this, in turn, requires modelling the uncertain 
connection between the distribution of dark matter and the 
formation sites of galaxies. Modelling cluster catalogues con- 
structed from X-ray data is a simpler problem theoretically 
since it bypasses the complications Eissociated with galgixy 
formation. Nevertheless, it requires a better understanding 
of the mccharusms that determine the total cluster X-ray 
lununosity than is available at present. It is our view that 
until the theoretical predictions are brought onto the ob- 
servational plane, measurements of the cluster correlation 
function cannot confidently be used to choose amongst com- 
peting cosmological models. 
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